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Abstract 

Mixed evidence on the relationship between school closure and COVID-19 prevalence 
could reflect focus on large-scale levels of geography, limited ability to address endogeneity, 
and demographic variation. Using county-level Centers for Disease Control and Prevention 
(CDC) COVID-I9 data through June 15, 2020, two matching strategies address potential 
heterogeneity: nearest geographic neighbor and propensity scores. Within nearest neighboring 
pairs in different states with different school closure timing, each additional day from a county’s 
first case until state-ordered school closure is related to 1.5 to 2.4 percent higher cumulative 
COVID-I9 deaths per capita (1,227—1,972 deaths for a county with median population and 
deaths/capita). Results are consistent using propensity score matching, COVID-19 data from 
two alternative sources, and additional sensitivity analyses. School closure is more strongly 
related to COVID-19 deaths in counties with a high concentration of Black or poor residents, 
suggesting schools play an unequal role in transmission and earlier school closure is related to 
fewer lives lost in disadvantaged counties. 
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Introduction 


Beginning in February 2020, K-12 schools across the United States began closing in an attempt 
to curb the spread of COVID-19. By March 25, every school district in the country had ended 
in-person instruction (Decker, Peele, and Riser-Kositsky 2020). However, evidence is mixed on 
the extent to which K-12 school closure slows the spread of coronavirus outbreaks (Viner et al. 
2020; Zhang et al. 2020). Influenza transmission is lower when schools are closed for the sum- 
mer or holidays (Chao, Halloran, and Longini 2010; Chowell et al. 2011; Earn et al. 2012; Huang 
et al. 2014; Litvinova et al. 2019; Wu et al. 2010) and school closures substantially reduce the 
number of social contacts among both students and workers (Cauchemez et al. 2008; Huang et al. 
2014; Jackson et al. 2011). Reviews find early school closures can slow influenza transmission 
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when infection rates are higher among children than adults (Jackson, Mangtani, Hawker, et al. 
2014; Jackson, Mangtani, and Vynnycky 2014). 

However, unlike most previous influenza and novel virus outbreaks such as HIN1 (Cauchemez 
et al. 2009; Chao et al. 2010), COVID-19 causes higher symptomatic infection rates at older ages 
(Nogrady 2020; Rasmussen and Thompson 2020; Wu et al. 2020; Zhang et al. 2020) and trans- 
mission rates from young children are relatively low (Park et al. 2020). K-12 school closures may 
therefore be weakly related or unrelated to the spread of COVID-19. School closure has high 
costs for children’s development and mental health, as well as for parents and communities 
(Coibion, Gorodnichenko, and Weber 2020; Orben, Tomova, and Blakemore 2020; Organisation 
for Economic Cooperation and Development 2020; UNESCO 2020). If school closures do little 
to prevent the spread of COVID-19, other public health measures that do not inhibit children’s 
development may be a better approach to control the pandemic. 

In addition to age, COVID-19 has unequal implications by race, ethnicity, and poverty. Black 
and Hispanic Americans are more likely to contract COVID-19 and experience severe symptoms 
than White Americans (Alcendor 2020; Centers for Disease Control and Prevention [CDC] 2020a; 
Zelner et al. 2020). Low-income individuals and communities also face higher risk of severe illness 
from COVID-19 (Koma et al. 2020; Wiemers et al. 2020), due in part to lower economic ability to 
practice social distancing (Jay et al. 2020). The relationship between school closure and COVID-19 
spread may differ by race, ethnicity, or poverty due to segregation and unequal health risks and 
resources (Carratala and Maxwell 2020; de Brey et al. 2019; Egede 2006; Phelan and Link 2015; 
Phelan, Link, and Tehranifar 2010; Williams 2012; Williams, Lawrence, and Davis 2019). Because 
school closures are a universal intervention, they may be especially effective at curbing the spread 
of COVID-19 in Black, Hispanic, and low-income communities. 

Country-level analyses of the relationship between school closure and COVID-19 inci- 
dence or mortality yield mixed results (Banholzer et al. 2020; Flaxman et al. 2020; Liu et al. 
2020), but state-level analyses within the United States suggest school closure is related to 
lower COVID-19 mortality (Auger et al. 2020; Rauscher 2020). The different results could 
reflect the different geographic level of analysis. COVID-19 is transmitted interpersonally 
and smaller-scale analyses may provide a better estimate of the relationship between trans- 
mission and non-pharmaceutical interventions (NPIs) such as school closure. Existing work 
has limited ability to address potential endogeneity in the relationship: national or state politi- 
cal leadership, for example, could influence both NPI implementation and COVID-19 preva- 
lence or mortality. 

It remains unclear whether or to what extent school closure timing is related to COVID-19 
spread or whether the relationship differs by race, ethnicity, or poverty. To help weigh the eco- 
nomic and social costs against the health benefits of closing schools (Cauchemez et al. 2009) 
during an outbreak with higher infection rates at older ages, this study: 


1. Estimates the relationship between state K-12 school closure timing and COVID-19 
deaths and cases within matched county pairs in different states and with different school 
closure timing. County pairs are identified using two matching strategies: nearest geo- 
graphic neighbor and propensity score matching. 

2. Estimates whether the relationship between state K-12 school closure timing and county 
COVID-19 deaths vary by county poverty, race, and ethnicity. 


Results suggest later school closure is related to higher COVID-19 deaths per capita and that the 
relationship varies by poverty and race/ethnicity. Closing schools later is related to higher 
COVID-19 deaths in counties with high concentrations of poor and Black residents. Results have 
implications for our understanding of the economic and social costs of school closures. 
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Background 
Schools and Public Health 


Education became a stronger determinant of individual and social life in the United States 
throughout the 20th and 21st centuries (Fischer and Hout 2006). Institutionalization and expan- 
sion of formal education has shaped culture, the workplace, religion, economy, and family, 
among other aspects of life (D. P. Baker 2014; Fischer and Hout 2006). Educational expansion 
has incorporated a greater portion of the population for longer periods, which has independent 
effects on other social institutions. Institutional theories of education suggest schools are an inde- 
pendent force in society, instilling cultural, organizational, and economic changes through ideas 
(e.g., about efficiency, appropriateness) and socialization (e.g., national identity, civic duty; D. P. 
Baker 1999, 2009, 2014; Cremin 1951, 1961; Meyer, Ramirez, and Soysal 1992; Meyer and 
Rubinson 1975; Meyer et al. 1979; Ramirez and Boli 1987). 

This institutional perspective makes clear the potentially important role of schools in public 
health efforts to control an infectious outbreak. However, while previous work emphasizes the 
importance of education through culture and ideas, schools also play a role through in-person social 
gatherings. For example, in addition to developing and disseminating knowledge, schools also 
develop social networks (Small 2009). Weeden and Cornwell (2020) illustrate the “small world” 
network at a medium-sized university in the United States. Using simulations, the authors found that 
most students are connected to one another through multiple transmission paths. Complex systems 
of social ties may cause schools to be particularly efficient spreaders of infectious disease. 

K-12 schools similarly create social networks, with children, teachers, and staff interacting in 
classes, cafeterias, buses, and playgrounds. K-12 students and employees typically reside at 
home and travel daily to school, connecting school and family networks. By reducing in-person 
interactions and overlap of social networks, school closure could reduce opportunities for 
COVID-19 transmission. Alternatively, families could substitute other social gatherings for 
school (e.g., church school, play dates, or informal shared family childcare). When schools close, 
families may engage in alternative social activities, yielding a null or positive relationship 
between school closure and COVID-19 transmission. For example, household networks may 
expand during school closures to include older family members for childcare, which could theo- 
retically increase symptomatic COVID-19 infections and deaths, due to higher susceptibility at 
older ages (Wu et al. 2020; Zhang et al. 2020). 

Existing evidence about the relationship between school closure and COVID-19 transmission is 
mixed (Auger et al. 2020; Banholzer et al. 2020; Flaxman et al. 2020; Haug et al. 2020; Liu et al. 
2020). The mixed results could reflect different geographic levels of analysis. For example, 
national-level analyses may be more likely to find a weak relationship because COVID-19 is trans- 
mitted locally through person-to-person interactions (Banholzer et al. 2020; Flaxman et al. 2020). 
However, a comparison of thousands of NPIs in 226 countries finds that school closure is strongly 
related to transmission (Haug et al. 2020). The mixed results could also reflect limited ability to 
address potential endogeneity in the relationship between school closure and COVID-19 transmis- 
sion. Unobserved characteristics such as political leadership could influence both school closure 
timing and COVID-19 prevalence or mortality. County-level analyses that apply matching strate- 
gies to address potential endogeneity contribute to existing evidence about the relationship between 
transmission and school closure at a finer level than state or national analyses. 


Schools and Inequality 


Horace Mann (1849)—a 19th-century Massachusetts politician and champion of the common 
school—called education “the great equalizer of the conditions of men” (p. 59). However, this 
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ideal role of education is far from reality. A common argument is that education could increase 
equality if children enjoyed equal access to education. Research examining labor market out- 
comes often finds greater equality of opportunity among those with higher levels of education 
(particularly, a college degree; Bernardi and Ballarino 2016; Breen and Jonsson 2007; Hout 1988; 
Torche 2011). However, when accounting for selection, recent work finds that higher education 
does not increase equality of opportunity (Fiel 2020; Zhou 2019). Even 19th century compulsory 
schooling laws did not increase equality of opportunity in the labor market (Rauscher 2016). This 
evidence raises questions about whether or in which contexts education increases equality. 

Similar questions arise about the relationship between education and health (Cutler and Lleras- 
Muney 2008). Education is significantly related to multiple health outcomes for both adults and 
children (Abuya et al. 2012; Currie and Moretti 2003; Goldman and Smith 2011; Gunes 2015; 
Hahn and Truman 2015; Ross and Mirowsky 1999). However, those who are more likely to attain 
higher levels of education may also be healthier for a variety of other reasons, including social 
background, behavior, or neighborhood exposure to pollution. Evidence is mixed when account- 
ing for selection into education. Extended compulsory schooling laws reduced mortality rates and 
improved health (Arendt 2005; Lleras-Muney 2005; Oreopoulos 2007) and greater access to col- 
lege increased women’s educational attainment and health of their children at birth (Currie and 
Moretti 2003). However, several rigorous studies that use natural experiments to account for 
unequal selection into education find no effect of education on health (Clark and Royer 2013; 
Dursun, Cesur, and Mocan 2018; Malamud, Mitrut, and Pop-Eleches 2018). 

Mixed evidence of the relationship between education and health could partly reflect the focus 
on individual-level outcomes. Education has spillover effects, with social benefits beyond the 
individuals who gain more education. For example, educational expansion may spur state or 
national economic growth and occupational upgrading (Lutz, Cuaresma, and Sanderson 2008; 
Rauscher 2015). Similarly, education may have effects on population or public health, beyond its 
individual-level effects (Benjamin-Chung et al. 2018; Berniell, de la Mata, and Valdes 2013). For 
example, schools may play a role in transmission of infectious disease. Because schools gather 
students from diverse families and backgrounds in one place, schools could allow infectious 
diseases such as COVID-19 to spread more quickly. 


Inequality and Health 


Because of systemic race and income inequality, school closure could affect COVID-19 deaths 
differently across groups. The fundamental cause argument explains how social inequality pro- 
duces health inequality. This theory suggests that those in economically and racially advantaged 
positions have greater access to flexible resources, which they use to maintain better health com- 
pared to those with fewer resources (Phelan and Link 2015; Phelan et al. 2010). Unequal resource 
distribution means that health inequality by race and income persists through multiple pathways 
(see e.g., Chetty et al. 2016), and cannot be ameliorated by individual interventions. The funda- 
mental causes of racism and income inequality manifest in the unequal distribution and experi- 
ences of COVID-19. 

Low-income, Black, and Hispanic Americans experience higher rates of COVID-19 infection 
than White and high-SES Americans (CDC 2020a; Centers for Medicare & Medicaid Services 
[CMS] 2020; Raine et al. 2020; Zelner et al. 2020). According to the CDC (2020a), Black 
Americans contract COVID-19 at 1.4 times the rate of Whites, while Hispanic or Latino indi- 
viduals contract the virus at 1.7 times the rate of Whites. Evidence also suggests that low-income 
people are much more likely than wealthier people to contract COVID-19. Data from the Centers 
for Medicare and Medicaid Services (CMS) show that people on Medicare who are poor enough 
to qualify for Medicaid are four times more likely to test positive for the coronavirus or be hos- 
pitalized from it than their high-income peers (CMS 2020). Barriers to physical distancing might 
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explain higher infection rates among poor, Black, and Hispanic individuals. These groups are 
more likely to be essential workers and live in more densely populated or crowded areas which 
puts them at higher risk for exposure to the virus (Arasteh 2020; Chen and Krieger 2020; Jay 
et al. 2020; Rogers et al. 2020). 

In addition to being more likely to contract the virus, Black and Hispanic individuals are also 
more likely to experience severe symptoms or death (CDC 2020a). There are several explana- 
tions for this which stem from systemic and long-standing inequalities in employment opportuni- 
ties and conditions, living conditions, health, and discrimination (Alcendor 2020; CDC 2020a; 
Williams 2012; Williams, Lawrence, and Davis 2019; Williams, Lawrence, Davis, et al., 2019). 
First, Black Americans have a higher incidence of chronic disease (e.g., obesity, hypertension, 
diabetes, asthma) than White Americans which makes them more susceptible to severe COVID- 
19 infections (Carratala and Maxwell 2020; Wiemers et al. 2020; Williams 2012). Second, Black 
and Hispanic Americans are more likely to be un- or under-insured, have less access to health 
care, and receive worse medical treatment, all of which likely make recovery from COVID-19 
more difficult (Alsan, Garrick, and Graziani 2019; Bor, Cohen, and Galea 2017; Carratala and 
Maxwell 2020; Egede 2006). 

Because school closures are a universal intervention and poor, Black, and Hispanic Americans 
are most affected by COVID-19, we expect school closures to be more strongly associated with 
COVID-19 deaths in counties with high percentages of poor, Black, or Hispanic residents. As 
argued by the theory of fundamental causes, health inequalities are likely to persist when inter- 
ventions focus on individual risk factors because those with more resources use them to leverage 
new opportunities (Phelan et al. 2010). In contrast, an intervention that is not resource-dependent 
(such as closing schools) should yield universal benefits, because it has the ability to reduce 
mortality rates regardless of individual resources. Furthermore, current data suggest that poor, 
Black, and Hispanic individuals are at greater risk of contracting and dying from COVID-19, and 
that low-income neighborhoods and majority-nonwhite neighborhoods have a higher incidence 
of COVID-19 deaths. If school closures are an effective intervention to reduce COVID-19 mor- 
tality, we expect this to benefit areas that have been hit harder by the virus: areas with high con- 
centrations of low-income, Black, or Hispanic residents. 

Alternatively, since low-income Americans and Black and Hispanic individuals are more 
likely to be exposed to the coronavirus from other sources (such as employment and neighbor- 
hood factors), school closures could be more weakly related to COVID-19 for these groups. 
Following from fundamental cause theory, because it does not address the root cause of racial or 
income inequality, school closure may have similar implications regardless of demographics and 
may not be sufficient to reduce racial disparities in health outcomes. 

Zajacova and Lawrence (2018) emphasize the importance of social context for understanding 
the relationship between education and health and leveraging it to improve population health and 
reduce health disparities. Using a finer level of geography than previous research and examining 
variation by county poverty and race/ethnicity, this paper quantifies the relationship between 
school closure timing and county-level COVID-19 deaths to provide more information when 
considering the high social costs of school closures (Coibion et al. 2020). At the same time, this 
paper provides a new analysis of the institutional theory of education (D. P. Baker 2014) by 
examining the relationship between schools and public health during a pandemic. 


Data and Methods 


Using daily cumulative data on COVID-19 deaths and cases by county from the Centers for 
Disease Control through June 15, 2020, this study examines dates after the first case in each 
county through the end of the academic year. June 15 includes dates through the end or within a 
few weeks of the end of the academic year in most states. June 15 also predates the publicized 
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interventions into CDC data by the Trump administration, which began in July when they trans- 
ferred COVID-19 data collection responsibility from the CDC to the Department of Health and 
Human Services (Stolberg 2020). COVID-19 prevalence measures are similar using county-level 
data from the CDC, The New York Times (NYT), and Johns Hopkins University (JHU). Mean 
values of deaths and cases per capita from NYT and JHU differ by less than 2 percent from CDC 
measures (see Table A1) and are highly correlated with CDC measures (0.995 or higher). Results 
are similar using NYT or JHU data (see Online Appendix Tables A12—A15). 


Key Measures 


COVID-19 prevalence is measured as the logged number of deaths (and cases) per 100,000 resi- 
dents. Time to school closure is the number of days from the first case in each county until 
schools were closed based on state-ordered school closure dates from EdWeek (2020a). All states 
closed schools within 11 days after President Trump declared a national state of emergency on 
March 13. Variation in prevalence rates on March 13 (0—56.3 cases per 100,000 residents; 0-1.4 
deaths per 100,000 residents) allows examination of the relationship between COVID-19 spread 
and school closure timing. Online Appendix Table Al provides information about each measure 
used in the analyses. 


Analytic Approach 


State school closure decisions could partly reflect state COVID-19 prevalence, potentially bias- 
ing state estimates. State closures are not based on prevalence in each specific county and school 
closure timing is therefore less likely related to prevalence and more exogenous at the county 
level. (Correlations between state school closure date and COVID-19 cases and deaths per capita 
[log] on the date of school closure are 0.30 and -0.28 at the state level and 0.16 and -0.01 at the 
county level, respectively.) 

Two matching strategies are used to address additional heterogeneity between counties: near- 
est geographic neighbor matching and propensity score matching. Matching is conducted using 
observed characteristics and unobserved characteristics may still differ after matching. These 
two strategies are complementary and results should be consistent using both approaches. First, 
National Weather Service geocode data for county centroid location is used to identify nearest 
neighboring county pairs in different states and the distance between those county pairs. Counties 
in close geographic proximity to each other have similar potential COVID-19 exposure and simi- 
lar economic and demographic contexts. However, state school closure timing is more likely 
exogenous with respect to county COVID-19 prevalence among nearby counties in different 
states. Online Appendix Table A2 Panel A compares variation among all counties compared to 
within county pairs less than 100 miles apart in different states. Variation within pairs is lower on 
all measures than in the unmatched sample (by 37—138 percent, depending on the measure). 
Mean differences between paired counties are less 0.1 standard deviations on all measures. 

Second, propensity score matching is used to identify county pairs with similar characteristics 
by school closure timing (Abadie et al. 2004; Rosenbaum and Rubin 1985). Specifically, we use 
psmatch2 (Leuven and Sianesi 2003) to identify the five nearest-neighbor matches between 
counties above and below the median of seven days from the first case in the county until states 
closed schools. 

We estimate propensity scores using multiple county- and state-level measures (see Table A2 
Panel B). Table A2 Panel B and Figure Al show evidence of balance on observed characteristics 
between treatment and control counties within matched pairs (using pstest, Leuven and Sianesi 
2003). The balance tests in Table A2 Panel B compare mean values, standardized percentage bias 
(difference in means divided by square root of the variance; Rosenbaum and Rubin 1985), and 
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variance ratios (ratio of variance of residuals in treatment and control groups for each variable 
regressed on the propensity score; Rubin 2001). Tests of overall balance and for each variable are 
within the ranges recommended by Rubin (2001). 

Using the propensity score from psmatch2, we use two approaches to estimate the relationship 
between late school closure and COVID-19 deaths: (1) we create strata (deciles) of propensity 
score and estimate the relationship within each strata (Long and Kurlaender 2009) and (2) we use 
the nearest neighbor match for each county with late school closure and estimate the relationship 
within county pairs. We limit the propensity score analyses to those in the range of common sup- 
port on the propensity score (excluding 405 counties with propensity score>0.9628). 


Statistical Analyses 


As with other matching approaches, the geographic and propensity score matching techniques do 
not address unobserved variation, but they adjust for observed heterogeneity and allow us to 
compare counties with similar observed characteristics other than school closure timing (Dehejia 
and Wahba 2002; Morgan 2001). We use the matched counties in regressions with fixed effects 
for county pair or propensity score strata to estimate the relationship between the continuous 
measure of school closure timing and COVID-19 deaths within similar counties. 

Equation | predicts log cumulative deaths per capita in county (i) in pair (7) as of June 15, 
2020 with time to school closure and county pair fixed effects. Within pairs of matched counties, 
exp(, ) estimates the percent increase in deaths for each additional day from a county’s first case 
until the state closed schools. Standard errors are adjusted for county pair clustering. 


In(COVID19 Deaths /100k),, = 0 + B,Timeto School Closure, +B, Xj, + Pair, + €; (1) 


We estimate the relationship separately within geographic pairs and propensity score pairs. We 
use a similar approach to estimate the relationship within propensity score strata (7), adjusting 
standard errors for clustering within strata. When using geographic pairs, models are fit with and 
without the following controls (_X, ) to address potential confounding: county-level population 
(log), population density (log), percent Black, percent Hispanic, percent Asian, percent urban, 
median income (log), poverty rate (from U.S. Census Bureau), percent of residents who fre- 
quently wear a mask in public, percent who always wear a mask, percent who rarely/never wear 
a mask (from NYT survey, July 2020); state-level number of public schools (log), public school 
enrollment (log), school closure date (from EdWeek 2020a); and median age, percent group quar- 
ters residence (from IPUMS; Ruggles et al. 2020), number of long-term care facilities with 
COVID-19 cases at the end of May (from Kaiser Family Foundation 2020; mean value assigned 
to seven non-reporting states). States implemented other physical distancing measures in addi- 
tion to school closure. Controls therefore include dates of state: stay-at-home order, non-essential 
business closure, and restaurant closure (coded as last observed date for states without each 
policy; from Raifman et al. 2020; results are similar using stay-home dates from Wall Street 
Journal and Aljazeera as of April 21). In the propensity score analyses, we fit models with and 
without controlling for propensity score and controls from the geographic analyses not used to 
create the propensity score (X, ). 


Heterogeneity by Poverty, Race, and Ethnicity 


Based on U.S. Census Bureau data, we create indicators for whether counties are above the 
median values of percent poverty (14.1 percent), Black (2.0 percent), Hispanic (3.3 percent), 
American Indian (0.4 percent), Asian (0.5 percent), and white (85.8 percent) residents. We also 
create indicators for whether counties are above 70th percentile of poverty rate (17.3 percent) to 
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distinguish counties with concentrated poverty. Equation 2 is similar to Equation 1, predicting 
log cumulative deaths per capita in county (/) in pair (7) with time to school closure and county 
pair fixed effects, but adds an interaction term between time to school closure and the indicator 
for high poverty rate (or high proportion of residents by race/ethnicity). Within pairs of matched 
counties, exp(B, ) estimates the percent increase in deaths for each additional day from a county’s 
first case until the state closed schools in low-poverty counties. Exp(f, ) estimates the additional 
percent increase in deaths for each day until school closure in high-poverty counties and tests 
whether the relationship between school closure timing and COVID-19 deaths differs by county 
poverty rate. Standard errors are adjusted for county pair clustering. 


In(COVID19 Deaths / 100k), = «+ B, Time to School Closure, + B,High Poverty, + 


B,Time to School Closure,*High Poverty, +B,X + Pair, + €; (2) 
As in the main analyses, we estimate the relationship separately within geographic pairs, propen- 
sity score pairs, and propensity score strata (standard errors adjusted for strata clustering). We fit 
models with and without the same controls (_X, ) in the main analyses. 


Sensitivity Analyses 


Geographic matched pair estimates shown (in Figure 2) are limited to county pairs less than 100 
miles and less than 40 miles apart, but other distance thresholds yield consistent results. Counties 
can be matched multiple times, resulting in multiple observations per county (mean is 2 observa- 
tions; range 1-10 in geographic pairs). Results are consistent when weighted by the inverse of the 
number of observations for each county (i.e. weighting each county to represent one observation) 
or when adjusting standard errors for county rather than pair clustering. 

Estimates within county pairs could be biased if states closed schools based on the number of 
cases in all of its counties or if cases were evenly distributed throughout states and states closed 
schools based on state prevalence. Estimates could be downwardly biased if counties or districts 
closed schools at dates other than the state-mandated closure date. Because of potential bias, 
estimates are associational and interpreted using associational language throughout the paper. 
However, Oster (2019) bounds are used to identify the degree of selection on unobservables 
required to explain the estimates (i.e. required to make coefficients zero). 

We replicate analyses using COVID-19 data from NYT and from JHU (2020; New York 
Times 2020). We use alternative measures of the dependent variable, including number deaths 
or cases (log) unadjusted for population size, when excluding the 7 states that recommended 
rather than ordered schools closed, when excluding the 5 counties in New York City (see 
Online Appendix Tables Al—A3), and when including counties with zero deaths (by adding 
one death per one hundred thousand residents), which are excluded when log-transforming 
deaths per capita. 

We repeat the propensity score analyses when matching on additional variables: % Black, 
median income (log), and state median age and restaurant closure date. Overall balance statistics 
did not meet recommended levels when including these measures, so the main analyses control 
for them in regressions rather than in propensity score calculation. Propensity score analyses are 
unweighted, but results are consistent when weighting by the frequency a county is matched 
divided by five (to normalize weights across the five potential matched counties). 

States implemented several other physical distancing measures during the COVID-19 pan- 
demic. Controls include dates of these measures, but it is difficult to isolate the role of school 
closures alone. A step in this direction is to limit analyses to county pairs with the same dates of 
other NPIs. 
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Figure |. Lowess graphs of (log) daily cumulative state COVID-19 deaths per 100,000 residents over 
time by timing of state non-pharmaceutical interventions. 

Note. Late Closure includes states in the top tercile of days from 100 cases until each measure. Early Closure includes 
states in the bottom tercile of days from 100 cases until each measure. Panel A: Late Schoo! Closure—on or after 
the day a state reached 100 cases. Early School Closure—5 or more days before reaching 100 cases. Panel B: Late 
Restaurant Closure—2 or more days after reaching 100 cases. Early Restaurant Closure—4 or more days before 
reaching 100 cases. Panel C: Late Stay-Home Order—16 or more days after reaching 100 cases. Early Stay-Home 
Order—less than 7 days after reaching 100 cases. Alternative stay-home order dates from Wall Street Journal and 
Aljazeera show consistent results. Panel D: Late Business Closure—I|3 or more days after reaching 100 cases. Early 
Business Closure—less than 5 days after reaching 100 cases. Quadratic graphs with confidence intervals indicate 
significant differences in Panels A and B and overlap in Panels C and D. 


COVID-19 prevalence and its relationship to school closure could change over time. 
Sensitivity analyses predict daily cumulative county COVID-19 deaths per capita with daily 
observations from the first case in each county. These analyses add date and county fixed effects 
in Equation | to account for stable differences between counties and national changes over time. 
Models are fit with and without controlling for a 1-day lagged measure of deaths per capita. 

We predict COVID-19 deaths and deaths due to all causes separately by race/ethnicity. 
However, the distribution of deaths by race/ethnicity are suppressed by the CDC in many coun- 
ties due to small numbers to protect privacy and data are only available for approximately 400 
counties. Significant differences (Paternoster et al. 1998) between B, coefficients in Equation | 
predicting Black and Hispanic deaths compared to those predicting white deaths would indicate 
that the relationship between school closure timing and death rates differs by race or ethnicity. 
Due to the small number of counties with mortality data by race/ethnicity, this approach is pro- 
vided as a sensitivity check. 


Results 


Figure | compares cumulative state COVID-19 deaths per capita over time by timing of school 
closure, restaurant closure, business closure, and stay-home orders. States that closed schools late 
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have substantially higher deaths per capita than states that closed schools early. In contrast, there 
is little difference in deaths per capita by stay-home order or business closure timing. Restaurant 
closure timing is related to differences in deaths per capita. Based on these preliminary analyses, 
we use dates of these NPIs (in addition to other measures) as controls or to match counties in 
statistical analyses. 

Table 1 shows estimates using geographic nearest-neighbor county pairs in different states 
with variation in school closure timing. Within nearby county pairs, one additional day before 
school closure is related to 1.5 to 2.4 percent higher county COVID-19 deaths per capita (an 
additional 1,227—1,972 deaths for a county with median deaths per capita and median popula- 
tion). Figure 2 shows estimates limited to counties less than 100 miles apart, less than 40 miles 
apart, and with and without controls, which include county- and state-level characteristics. 
Results are consistent when predicting COVID-19 cases and when limited to counties at other 
maximum distances. 

Table 2 shows estimates when matching counties based on propensity score, with similar 
observed characteristics but with different school closure timing. Within the same propensity 
score strata, one additional day before school closure is related to 1.2 to 1.6 percent higher county 
COVID-19 deaths per capita (an additional 980—1,310 deaths for a county with median deaths 
per capita and median population). Within propensity score pairs, there is no relationship between 
school closure timing and deaths when not including controls, but the estimate is similar to other 
analyses when including controls: an additional day before school closure is related to 1.7 per- 
cent higher county COVID-19 deaths per capita (an additional 1,392 deaths for a county with 
median deaths per capita and median population). Propensity score models predicting COVID- 
19 cases add further support for a relationship to school closure timing. 

Table 3 shows results examining variation in COVID-19 deaths by county poverty, race, and 
ethnicity. Coefficients for the interaction between school closure timing and high proportions of 
poor or Black residents suggest variation by poverty and race, but not by ethnicity. For example, 
Models 5 and 6 in Panel A (within geographic county pairs) suggest minimal relationship between 
school closure and COVID-19 deaths in counties below median percent Black residents. 
However, in counties with a high proportion of Black residents, an additional day before school 
closure is related to 1.9 to 2.3 percent higher county COVID-19 deaths per capita. Propensity 
score analyses similarly find a significantly stronger relationship between school closure and 
deaths in counties with a high concentration of Black residents (marginal estimates illustrated in 
Figure 3). Poverty interaction terms are not statistically significant within geographic pairs. 
However, propensity score analyses suggest school closure is more strongly related to COVID- 
19 deaths in high poverty counties (above the 70" percentile of poverty rate). Specifically, each 
additional day before school closure is related to approximately | percent higher COVID-19 
deaths in low-poverty counties, but this relationship is significantly higher (1-3 percent) in high 
poverty counties. Interaction terms are not significant for counties above the median poverty rate, 
suggesting the relationship emerges in counties with high poverty rates. Interaction terms are also 
not significant for counties with a high proportion of Hispanic residents. Thus, results indicate 
variation in the relationship between COVID-19 deaths and school closure by poverty and race, 
but not by ethnicity. 


Sensitivity Analyses 


Sensitivity analyses find consistent results when predicting the number of deaths or cases (log) 
unadjusted for population size, when excluding the 7 states that recommended rather than 
ordered schools closed, or when excluding the 5 counties in New York City (see Online Appendix 
Tables A3—AS5). 


Rauscher and Burns 841 


Table |. Predicted County COVID-19 Deaths/Cases per 100,000 Residents within Geographic County 
Pairs. 


Panel A: County Pairs <100 Miles Apart. 


(1) Q) 3) (4) 
Deaths/| 00k (log) Cases/100k (log) 

Variables Baseline *Controls Baseline *Controls 

Days from first 0.018** 0.015* 0.022** 0.01 6** 
case until school (0.005) (0.006) (0.002) (0.002) 
closure 

Constant 2.817** |,670.333** 5.608** 361.738 

(0.019) (482.489) (0.018) (220.399) 

Observations 1,452 1,452 3,322 3,320 

Number of county 726 726 1,661 1,660 
pairs 

R-squared 710 768 774 820 

County pair fixed Y Y Y Y 
effects 

Controls ¥ Y 


Panel B: County Pairs <40 Miles Apart. 


(I) (2) (3) (4) 
Deaths/ 100k (log) Cases/ 100k (log) 
Variables Baseline *Controls Baseline *Controls 
Days from first case until school closure 0.024** 0.024* 0.01 7** 0.01 1** 
(0.008) (0.012) (0.002) (0.003) 
Constant 2.944 2,332.964** 5.677** 589.500 
(0.032) (843.829) (0.022) (405.333) 
Observations 492 492 1,040 1,040 
Number of county pairs 246 246 520 520 
R-squared 775 842 832 868 
County pair fixed effects ¥ Y Y Y 
Controls Y Y 


Note. Predicted CDC COVID-19 data by county as of June 15, 2020 using EdWeek school closure data. Limited to 
county pairs in different states with unequal school closure timing. Counties are matched |-10 times with mean of 

2 matches. Even numbered models include controls for county population (log), population density (log), percent 
Black, percent Hispanic, percent Asian, percent urban, median income (log), poverty rate (from U.S. Census Bureau), 
percent of residents who frequently wear a mask in public, percent who always wear a mask, percent who rarely/ 
never wear a mask (from NYT survey, July 2020); state number of public schools (log), public school enrollment 
(log), school closure date (from EdWeek); median age, percent group quarters residence (from IPUMS), number of 
long-term care facilities with COVID-19 cases at the end of May (from Kaiser Family Foundation; mean value assigned 
to seven non-reporting states), stay-at-home order, non-essential business closure, and restaurant closure (coded 

as last observed date for states without each policy; from Raifman et al. 2020; similar results using stay-home dates 
from Wall Street Journal and Aljazeera as of April 21). Robust standard errors adjusted for county pair clustering in 
parentheses. CDC = Centers for Disease Control and Prevention; IPUMS = Integrated Public Use Microdata Series. 
*b <.1.*p < .05. **p < 01. 


Results are also consistent when including counties with zero deaths, which are excluded 
when log-transforming deaths per capita. Table A6 shows results when assigning a value of 
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Figure 2. Predicted COVID-19 deaths and cases per 100,000 residents within geographic county pairs. 
Note. Estimates based on models in Table |. Within nearest neighboring county pairs in different states with different 
school closure timing, each additional day from a county’s first case until state-ordered school closure is related 

to 1.5 to 2.4 percent higher cumulative county COVID-19 deaths per capita as of June 15, 2020 (an additional 
|,227—1,972 deaths for a county with median deaths per capita and median population). Estimates are B, coefficients 
from Table | with 95 percent confidence intervals (standard errors adjusted for county pair clustering). Analyses are 
limited to counties <100 miles or <40 miles apart (alternative distance thresholds yield consistent results). 


one death per 100,000 residents to counties before the first death occurred to include all 
matched counties. 

On the whole, results shown in Tables A7—A9 are consistent when limited to county pairs with 
the same dates of restaurant closure, stay-home order, or business closure. The main analyses 
control for the timing of other physical distancing measures and these sensitivity analyses add 
further confidence that school closure is independently related to COVID-19 spread. However, 
small samples sizes among propensity score pair analyses (Table A9) indicate the limitations of 
this approach. 

As a robustness check, Table A10 shows results examining the relationship between COVID- 
19 prevalence as of December 15, 2020 and state Fall 2020 school opening policies (from EdWeek 
2020b). Compared to counties in states that began Fall 2020 K-12 schooling in-person, those in 
states that began at least partially closed had 20 to 53 percent lower cumulative COVID-19 
deaths as of December 15, 2020. 

Table All shows results from models predicting daily cumulative county COVID-19 deaths 
per capita with date and county fixed effects, weighted by the inverse of the number of observa- 
tions for each county to prevent counties where the first case emerged earlier from driving results. 
Controlling for lagged COVID-19 deaths, results suggest each additional day before school clo- 
sure is related to 0.1 percent faster daily growth in COVID-19 deaths. 

Tables A12—A15 show results when using COVID-19 data from the NYT and JHU. Results are 
consistent with the main analyses. We repeat propensity score analyses when matching on four 
additional variables: % Black, median income (log), and state median age and restaurant closure 
date. Results are similar to the main analyses (shown in Online Appendix Tables A16—A17). The 
main analyses control rather than match on those measures to achieve the recommended balance 
levels (Rubin 2001). 

Aconcern in interpreting results is that school closure timing may be endogenous. Unobserved 
characteristics could influence both school closure and COVID-19 deaths. Using Oster (2019) 
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Table 2. Predicted County COVID-19 Deaths/Cases per 100,000 Residents within Counties Matched 
Based on Propensity Scores. 


Panel A: Propensity Score Strata. 


(1) (2) (3) (4) 
Deaths/ 100k (log) Cases/100k (log) 
Variables Baseline *Controls Baseline *Controls 
Days from first case until 0.01 6* 0.012* 0.01 9** 0.013* 
school closure (0.006) (0.004) (0.006) (0.005) 
Constant 2.548** 75.627 5.865** —108.024* 
(0.041) (48.831) (0.042) (42.790) 
Observations 1,446 1,446 1,446 1,446 
number of strata 10 10 10 10 
R-squared 046 164 025 211 
County strata fixed effects Y Y Y Y 
Controls Y Y¥ 
Panel B: Propensity Score Pairs. 
(I) (2) (3) (4) 
Deaths/100k (log) Cases/100k (log) 
Variables Baseline * Controls Baseline * Controls 
Days from first case until 0.000 0.017** 0.01 6** 0.0177** 
school closure (0.004) (0.005) (0.004) (0.005) 
Constant 2.468** 31.294 5.855** —146.624 
(0.035) (95.456) (0.034) (107.435) 
Observations 1,226 1,226 1,226 1,226 
number of county pairs 613 613 613 613 
R-squared 79 .580 493 610 
County pair fixed effects Y Y Y Y 
Controls Y Y 


Note. Predicted CDC COVID-19 data by county as of June 15, 2020 using EdWeek school closure data. Limited to 
counties within range of propensity score support. Even numbered models include controls for county propensity 
score, percent Black, median income (log), state median age, and date of restaurant closure (coded as last observed 
date for states without; from Raifman et al. 2020). Robust standard errors adjusted for county strata (Panel A) or 
county pair (Panel B) clustering in parentheses. CDC = Centers for Disease Control and Prevention. 

*p <.1.*p < .05. *p < 01. 


bounds, estimates for school closure timing predicting deaths per capita remain positive when 
assuming potential selection on unobservables is 13 percent higher than selection on observables 
within county pairs less than 100 miles apart (6 = 1.13) and over seven times selection on 
observables within pairs less than 40 miles apart (6 = 7.22). Within propensity score strata, esti- 
mates remain positive when assuming selection on unobservables is nearly eight times selection 
on observables within strata (6 = 7.97). Bounds are calculated by assigning a maximum r-squared 
value of 1.3*estimated r-squared (the value at which 90 percent of randomized estimates survive; 
Oster 2019). Bounds are appropriate for coefficients that decrease when adding controls and 
therefore are not calculated for estimates based on propensity score pairs. 
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Figure 3. Predicted COVID-19 deaths per 100,000 residents within propensity score strata. 

Note. Marginal estimates at sample mean values based on odd-numbered models in Table 3, Panel B. Within 
propensity score strata, the relationship between school closure timing and predicted county COVID-1I9 deaths per 
100,000 residents is higher in counties with high poverty rate and high proportion Black residents. 


Conclusion 


Although children do not appear to be the primary spreaders of COVID-19 (Lee and Raszka 
2020; Park et al. 2020), earlier school closure is related to lower deaths and cases per capita in 
U.S. counties. Previous inconsistent evidence on the relationship between school closure and 
COVID-19 prevalence could reflect a focus on large-scale levels of geography, limited ability to 
address endogeneity, and variation by poverty, race, or ethnicity. We build on previous work by 
examining county-level data and applying two matching strategies to address potential heteroge- 
neity: nearest geographic neighbor and propensity score matching. Within nearest neighboring 
county pairs in different states with different school closure timing, each additional day from a 
county’s first case until state-ordered school closure is related to 1.5 to 2.4 percent higher cumu- 
lative county COVID-19 deaths per capita. Propensity score analyses suggest that one additional 
day before school closure is related to 1.2 to 1.7 percent higher county COVID-19 deaths per 
capita. For a hypothetical county with median population and deaths per capita, this amounts to 
980 to 1,972 deaths for each day before school closure. 

We find that school closure is more strongly related to COVID-19 deaths in counties with a 
high concentration of Black or poor residents. Schools may therefore be unequal opportunity 
spreaders, playing an unequal role in transmission. The stronger relationship could reflect 
unequal school resources (if schools in advantaged counties can leverage resources to prevent 
spread during in-person schooling; B. D. Baker et al. 2018; Sosina and Weathers 2019), unequal 
family or individual socioeconomic resources, and higher risk of COVID-19 exposure and severe 
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symptoms due to economic and social inequality and preexisting health conditions (Alcendor 
2020; CDC 2020b; Wiemers et al. 2020; Williams 2012; Williams, Lawrence, and Davis 2019; 
Zelner et al. 2020). 

School closure creates a larger burden for families—especially those with few resources, but 
it is a relatively universal intervention which does not require family resources to access or 
adopt. The universal nature of school closure may account for its stronger link to COVID-19 
deaths among Black and low-income Americans (Phelan and Link 2015; Phelan et al. 2010). 
Results do not differ in counties with high concentrations of Hispanic residents, which could 
reflect the lower rate of preexisting health conditions related to severe COVID-19 symptoms 
(Wiemers et al. 2020) and the Hispanic health paradox, which yields better health than expected 
for both individuals and communities (Hummer et al. 2007; Markides and Coreil 1986; Shaw 
and Pickett 2013). 

School closure has social, economic, psychological, and cognitive costs for individuals and 
societies (Coibion et al. 2020), with potentially long-term implications, which is why states 
rarely close schools. In the context of the first wave of COVID-19; however, results suggest 
school closure is related to lower death rates. Estimates are robust to multiple sensitivity analyses 
and remain positive when assuming potential selection on unobservables is larger than selection 
on observables—in some analyses over 7 times larger (Oster 2019). The costs are high, but in the 
absence of a robust test and trace containment strategy, earlier school closure was related to lower 
COVID-19 deaths during the first outbreak in the United States. 

Although other physical distancing policies are not the focus of this study, descriptive 
results echo existing work (Friedson et al. 2020; Lin and Meissner 2020), which provides 
mixed evidence for a relationship between COVID-19 spread and stay-at-home orders and 
non-essential business closures. This may be because those measures were generally imple- 
mented later than school closures. Restaurant closure shows some relationship with COVID- 
19 deaths. Future research could directly examine restaurant closure compared to other physical 
distancing measures. 

An important limitation of this study is that it cannot establish a causal relationship. Two 
county matching strategies are used to reduce the possibility that unobserved heterogeneity could 
account for the results. However, matching is necessarily based on observed characteristics. 
Similar results using both geographic and propensity score matching strategies reduce the pos- 
sibility that random variation or selection explain the findings, but additional research is required 
to establish a causal relationship. 

Evidence from the above analyses supports institutional theories of education, which suggest 
schools are an independent force in society (D. P. Baker 2009, 2014). While work in this area has 
focused on the importance of education in shaping culture and the economy, results of this study 
add to evidence that schools can also play a role in public health through in-person social gather- 
ings (Weeden and Cornwell 2020). The role of schools likely differs by the specific disease 
examined, but results suggest public health implications of schools can spillover and extend 
beyond those who enter schools to include other community members (Benjamin-Chung et al. 
2018; Berniell et al. 2013), including older individuals. 

Furthermore, this study contributes to our understanding of the relationship between educa- 
tion and inequality. Recent work finds that education does not increase equality of opportunity in 
the labor market (Fiel 2020; Rauscher 2016; Zhou 2019) and has no effect on health (Clark and 
Royer 2013; Dursun et al. 2018; Malamud et al. 2018). However, the majority of studies focus on 
individual-level outcomes, omitting important spillover effects and social benefits of education. 
Results of this study add to evidence of the link between education and population or public 
health (Benjamin-Chung et al. 2018; Berniell et al. 2013). Results suggest schools may play a 
role in transmission of infectious diseases. In the case of COVID-19, schools are more strongly 
related to death rates in counties with a high proportion of Black or low-income residents. This 
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evidence is consistent with a recent finding that school closure is unrelated to COVID-19 preva- 
lence in areas with low infection rates, but may help reduce spread in areas with high infection 
rates (Harris et al. 2021). Our results suggest school closure may be related to lower COVID-19 
deaths in disadvantaged areas. Balancing the unequal learning opportunities created by closing 
schools (Aucejo et al. 2020; Bacher-Hicks, Goodman, and Mulhern 2021; Chetty et al. 2020; 
Kuhfeld et al. 2020; Lancker and Parolin 2020) with the potential for lower mortality among 
disadvantaged populations poses an ethical challenge. 
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